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ABSTRACT 


The need for secure voice communication systems is 
increasing both in the civil and military arenas. 
Coupled with this is the need for conserving 
bandwidth, increasing performance, and reducing costs. 
Currently used secure voice methods are relatively 
antiquated and do not provide desired performance and 
bandwidth conservation without incurring increasing 
costs. A new system, proposed herein offers bandwidth 
reduction, increased performance, and GdecreaSing costs 
While using modern digital techniques aS opposed to 
analog techniques. The proposed system, known as 
VOCOM, operates in existing voice bandwidths using 
existing equipment, and offers a higher level of 
privacy and security while at the same tire 
Simplifying software handling. AdG we vO tal iy Go Ene 
proposed system offers the user real-time operation to 


enhance critical decision-making. 
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I. LE RODUCT LON 
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The need for Security in voice communications systems is 
steadily increaSing. Sensitive information is constantly 
being passed between people which, if intercepted by 
unintended or undesireable elements, could significantly 
affect the original purpose of the communication. AS a 
conseguences, the communicating parties stand to lose money, 
poSition, status or crucial elements of their own livelihood 
miewucing National or personal security. Some protection 
methods are relatively antiguated and ineffective, Compared 
to the CurLent electronic NStdter Ob Stine Yatra. 
intercepting uncovered transmissions as well as ccvered 


mmeansmissions. 


Voice security requirements reach into wide areas of 
business, civil and military communications. The need for 
VOuce Security iS obvious in credit, stockmarket, and 
banking orerations where information transferred by voice on 
uncovered lines demands confidentiality. These systems are 
Vuinerable to attack by eavesdroppers and intelligence 
gatherers seeking to sabotage or threaten communications. 
Law enforcement communication systens clearly need 
protection of voice communications. Although many of the 
police systems empioy coding schemes, determined 
Meer ceptors can, by patient association, breax the spoken 


codes. 


Mie tne tre litary voice communication applications, 
elaborate daily-changing coding schemes have been developed 
aS a sophisticated method of ensuring communications 


security. However, determined or hostiie agents need only 





monitor and iink several communications of the same subject 
matter to discern a pattern of the transmitted content. 
Elaborate and complex encryption schemes employed by the 
military necessitate computers and space-consuming software 
such aS publications and decoding tools. Because of its 
large volume cf usage, cost, size, and physical space 


requirements are usually justitied by economies of Scale. 


There are smaller, local applications of voice security 
that dc net require elaborate equipment, expense, Or 
physical space requirements of larger-scale operations. The 
Same protection afforded the larger systems, of course, would 
be desirable in the smaller systems but the hardware and 
software compiexity is plone justified. Congressional 
Criticism of insecure voice equipment during the Viet Nam 
conflict documented a need for voice coding systems 


applicable tc local, smalier needs. [23] 
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Voice coding systems are of two general types: analog 
and digital. Digital systems convert voice Signals directly 
into a number or digit stream, transmitting these bits in 
place of a voice Signal. Sxisting digital systems require 
more than the nominal 3000 hertz pandwidth available in most 
telephone applications. A digital system is therefore 
referred to aS a wideband system. Wideband is defined as 
being several times the unencoded base band Signal width 
compared to a narrow band of approximately the same 
bandwidth [16]. The wide band virtually eliminates retrofit 
compatibijity without extenSive revork of existing 
communications systems. P21) Daigatal characteristics are 
well suited to systems employing pseudo-random encoding data 
streams. [8] Datga ta) systems typicaily have 
pmemeg-~to-digital and digital-to-analog converters with 


coding and decoding of a digital data strean. 


A. ANALCG S¥STENS 


Analog systems are used in voice security systems more 
extensively than digital systems. They are characterized by 
balanced mixers, oscillators and filters. The various types 


are discussed below. [8] 





Inversion is the nane for a Scrambling process that 


provides security Ey systematic modification of a voice 


Signal before transmission. In its Simplest form it 
interchanges low voice Frequencies and high voice 
frequencies. It operates by changing each frequency 


component present in a voice signal to a new frequency, 
where the new frequency is the difference between the 
original frequency and the reference or inversion frequency. 
For example, at a reference freguency oft 3000 hertz, a voice 
component at 750 hertz would he converted to a component at 
Beeo-750 hertz or, 2250 hertz. A scrambled signal must be 
unscrambled at the receiving end using a second inverter of 
the same reference frequency. When the scrambled 2250 hertz 
freguency 1s subtracted from the reference frequency, 750 
hertz, the original frequency voice component is restored. 
The €ase of unscrambling inverted speech makes this system 
vulnerable to unsecure trénsmisSion. An eavesdropper need 
only use an inverter with an adjustable reference frequency 
oscillator, tuning the oscillator until the speech is 
intelligible. Moreover, With concentrated attention, 
inverted speech can be learned directly in about four hours. 


mugmce | Gepicts inversion. [8] 
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Figure 1—- INVERSION 
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A more secure method of scrambling either divides 
the 300-3000 hertz voice band into several subbands, or it 
inverts them, or both. This iS Known as band-splitting. 
Unscrambling is achieved by interchanging the signals in the 
subbands and reinverting then as required. Its advantage 
over the sinple inversion technique is that many different 
code settings are possible according to how the different 
sukkands are rearranged in the scamnbling process. Unlike the 
inversion technigue, one cannot learn to directiy understand 
mime Scrambled output of a band-splitter. However, by 
repeating a message several tines, many of the words can be 
unscrambled by the human ear. It is also possible to 
eavesdrop by using eguipment that returns just one cf the 
subbands to its prcper place thus rendering this methed 
Susceptible to relatively simple attacks Figure 2 depicts 
momea-Solittirg. { 8] 





Relative Power 
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1000 2000 S000) Herirz 


(a) Original voice signal spectrum 





1000 2000 3000 Hertz 
(b) Serambled voice signal spectrum; (note 


both displacement and inversion of the bands) 


Figure 2 - BAND-SPLITTING 
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SeCUbity CL tered by inversion and band-splitting 
technigues can be enhanced by adding extraneouS tones oF 


mepkse in a Scrambler to mask the volce. These additions must 


be filtered out by an unscrambler. Pica Leh Tener y sot 
mee tering is aggravated by the presence of harmonic 
distortion in transmission systems. Such jsGlScOrt lon willed 


generate noise and tones at new frequencies where they 
cannot be removed without also removing some of the voice 
signals. Consequently, masking increases the security of 
voice communication while reducing message intelligibility 


by the intended listener. 


4. Time Domain Systems 


Another security method leaves a voice signal in its 
Original freguency components but divides a signal into tine 
elements, transmitting the various elementsS in a rearranged 
sequence. This is known aS a time domain system while the 
previous systems operate in the frequency domain. The time 
domain of an inversion system would generate speech in 
reverse order of time. This is not normally done in practice 
because a Scralipbier would have to wait until a complete 
message waS expressed pefore it could be transmitted in 
reverse. This involves delays in the communications prcecess. 
A better method is to divide the message in to small 
distinct tine segments to delay for varying brief intervals 
berore reprcducing them. This mixes the order of the voice 
segments while making the output unintelligble without 
compatible eyuipment. fo date, these systems have required 
large, exrensive magnetic recorders. At present there are 


no time dcmain scramblers on the U.S. market. 
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The mwethods discussed thus far were assuued to use 
fixed codes. Using a continually changing code sequence at 
different times increases the difficulty cf inteiligible 
miLcuception myean cavesdropper unless he had access to the 
specified ccde sequence being used at a particular time. In 
a properly designed system enough different codes can be 
used so as to make it impractical for an eavesdroprer to 
find the correct one. This method is enhanced even further 
if the code is changed as often as each day. This system 
reguires frequent dissemination of particular codes tec the 
users and security precautions to ensure that the codes do 


not fall intc unauthorized hands. 


B. DIGITAL TECHNIQUES 


few duality of digital transmissions is affected by the 
number of encoding levels in the bit stream. Tne larger the 
number of levels, the more bandwidth is required to transait 
the signals. Pandwidth being a scarce commodity, an increase 
in bandwidth usage will result in an increase in cost. 
Somventicnal digital voice security systems are 
characterized by by use of larger amounts of bandwidth 
(wideband) than analog voice security systems (narrowband). 
They are conseguentiy more expensive than analcg systems. 
Nonetheless, digital systems have technical advantages, 
especially when sending vast amounts of data over long 
distances where storing of the data 1S required. Sending 
analog signals over iong distsances reygquires expensive lines 
and high guaility radio links. Long telephone iines_ can 
Gause drastie attenuation in high frequencies. This effect 


Can be overccite only by the use of costly coaxial or other 
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special cables. In contrast, even if transmission quality is 
poor, a digital system need only detect whether a "0" or a 
"it was sent. It need not detect complicated and detailed 
waveforms of speech. Much poorer Signals can be acceftably 
decoded and no noisSe or interference is present in the data 


sent; the output will be noise and interference free. 


Another method of telephone trananission converts sounds 
into a stream of bits (pulse code modulation) and eliminates 
need for the costly modems required in analog systenus. [16] 
The pulse code modulation (PCM) technigues being constructed 
equate to one channel becoming the equivalent of 56,000 bits 
per second in each direction. During ah experiment in 
Europe by HKartin [16] a computer, a teletype machine, anda 
telephone line as the communication link took place. During 
one observation an analog voice line capable of transmitting 
4,800 bits per second was used. In a half hour it coulcd 
mermsomit i800 x 4,800 or 8,640,000 bits. In fact, it had 
only sent 21,000 bits of data. Its efficiency was 21,000/ 
8,640,000 or .0024, a poor use of an expensive faciiity. 
Vcice lines use FCN technigues in which one telephone 
Channel Eeccmes equivalent ta 56,000 bits per second in each 
direction. This bit stream could conceivably transmit 1,800 
x 56,000 bits in a half hour in each direction. Using tine 
Sharing, the transmisSion efficiency could then be said _ to 
moe 1 ,00072 x 1,800 x 56,000 or .0001. If we could push the 
efficiency up to .25 we would have an improvement of 2,500 
times. On an analog voice line used at 4,800 bits per 
second, a one hundred fold improvement would result. This 


can be accomplished through time sharing. [ 16] 


Analog storage is bulky and cumbersome, requiring tapes, 
and discs that are susceptible to damage loss, or 
Misplacement. The use of digital storage techniques cffers 
the user large amounts of storage capacity, fast access 


time, processing in real time, and minimal human handling. 
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A new system proposed here reduces PCM reguirements from 
56,000 bits per second to a mere 200 bits per second. The 
frereased advantage goes from an efficiency of {in the 
Bxanmple cited) -0024 to 3000 x 7/1,800 x 200 or .0583. The 
bit rate reduction is over 400:1. In general, a decrease in 
bandwidth decreaseS expense. Therefore, restricting the 
bandwidth decreases expense. This bandwidth reduction 


technique can be applied to voice security systems. 
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Ae REASCN FOR DIGITAL 


In ancrmal speech there is a considerable redundancy of 
expression which means that more synbols are transmitted 
than are required to communicate the information. Parts of 
words and often whole words frequently are not required to 
communicate effectively. Navai messages typically are 
reduced to acronyms, abbreviations, and shortened words 


Which are understandable to the communicating parties. An 


example of redundancy 1S *ueue"® in "queue", The "ueue"™ 
sound altays follows the "q" and therefore the "ueue" is 
meaundant. "Ihe as also frequently redundant. Most 


Eedundancy resuits from rules and limitations placed on 
languages, excluding usable combinations of letters. eel 
language permitting any pernutation of four letters to be a 
Wend, Such as “"ngwv", then 456,976 words, or approximately 
the number of vordS in an unabridged dictionary, woulda 
exist. The English language prohibits a combination such as 
ngwv", rendering it more redundant than the hypothetical 
Four letter language. Limitations on vocabulary add to the 
waste. A child's use of the word "play" may be changed by an 
adult to "frclic" or "amusement". It is more redundant for 
someone to “accomplish something" than it is for somecne to 
"do something", According to Shannon, [24] two extremes of 
redundancy e¢xist: one extreme iS in use of additional whole 
words to add emphasis to an idea; the other extreme iS in 
Superfluous inflections and drawls of various dialects. 


"The basic English vocabulary is limited to 850 words and 
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the redundancy very high. This is reflected in the expansion 
that occurs where a passage is translated into hasic 


pagilish." 


Rules, limitations, formalities, and the desire to 
modify language and speech create redundancy. AS a result, 
English is about 75% redundant which is to say only 25% of 


English text 1S necessary 1f£ it were wholly nonredundant. 


[ 10 J 


In information theory entropy must be eliminated from a 
svsten. This is precisely what the proposed digital systen 
does. By reducing the input data to a nonredundant level, 
much useless data is discarded while working parts are 
retained. As an example, data is transferred at the rate of 
60 kiiobits per second in Bell Telephone digital links with 
enormous redundancy since human Speech conveys meanings at 
the rate of cnly a few hundred bits per minute. Speech as 
an audio signal is limited to a data rate of only several 
hundred bits per second. The proposed digital system 
transmits speech virtually without redundancy. Known as 
VOCON, (Voice CCiimunication through COMpression and 
COMputation) it sends a series of digits that instruct a 
Syntnesizer tc recreate speech, instead of sending the 
Original speech waveforms. Only several hundred bits, a 
fraction of the real amount, need be sent. [25] Because the 
human ear is sensitive to amplitude and freguency changes, a 
VOCOt processed signal will vary from the original yet still 
have sufficient guality to be intelligble. [17] Potential 
losses from data compression include recognition of who is 
speaking, transmission of the emotional content, and 


monversational effort. 
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Pee botGN OF THs VOCOd SYSTEM 


The design of a voice security system must be compatible 
With existing radio telephone equipment. It must also 
provide a reasonable amount of privacy against not only the 
"sncoper" on the RF channel but also against the loss or 
compromise of equipment. Finally, it must be reasonable in 
Moet, The censtraint of making the security system 
compatibice with existing equipment restricts the 


Goumunication channel to 300 to 3000 hertz. 


A digital ccnmputer has been designed to receive a 
continuous electrical Signal and transform it into data at a 
comparatively slow rate for input into a general purpose 
processor. The machine uSesS digital circuits throughout to 
compute instantaneous values of frequency and power in real 
time. This macnine is a special purpose digitai computer 
capable cf various modes of transformation through the use 
of Fourier transforms. This makes it easily applicable to 
the analysis of ‘Speech. Tt also allows hardware to be 
time-shared among several filters capable of examining 
components in any band CeBeiiee Ud! prc eopece bun. the digieat 
fjeeer Dank, consisting of 128 filters, is capaonle of 
providing 10 octaves of data at Semitone intervals to the 
fegecal cScillator bank with 64 oscillators. Although it 
Was designed for transmitting music, its potential is great 
in secure voice applications. (Contact One future 
reference: Newent omecr Perce, Eisetronics “Music Studios, 
277 Putney Bridge Road, London, SW152PT England.) 
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C. HARDWARE 


The Nheart" of the system reiles upon the PDP-8 
Minicomputer Manufactured by Digital Equipment Corporation 
Meboenain Street, Maynard, Massachusetts, 01754). It uses a 
12-bit word length intended for laboratory and process 
control applications with original system prices of 
approximately $28,500. [ee jeer EGQuarad by not called a 
minicomputer, it rapidly became very popular and soon Lbecane 
recognized as the mies © mass-produced and pepular 
Minicomputer and the first computer to sell for less than 
$20,000 (CPU only). A memory instruction can reference any 
of 128 aidresses on itS own page, or any of 128 addresses on 
other pages. With indirect addressing, any locaticn in 
memory can ke referenced. These 128 addresses coincide with 


the 128 filters and 64 oscillators in the vVOCcOo% system. 


The digital hardware consist of two PDP8 computers, a 
disc file, and a fast paper reader/punch with an attached 
Magnetic tape drive. The input and output system makes it 
SUitable for real-time appiications. A crystal clock in the 
interrupt line delivers synchronizing pulses at 400 hertz or 
a sub-multipfle of this frequency. There are also 10 
Kilohertz digital-to-analog and analog-to-digital converters 


for visual purposes. [30] 


mcm COMpPUterS control the pitch, timing, amplitude, and 
waveform through three panks The computers also control the 
gain and respose mode of 64 narrow passband filters placed 
at semitone intervals over five and one half octaves. Nine 
other oscillators and function generators, six amplifiers, 
two variakle response filters, and a number of other dévices 


such as noise generators are also controlled by the 
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computers. Most of the connections are done manually at a 
patch panel but up to twenty or them may be connected 


through compater controlled audio switches. [9] 


There must first be an analog signal from a telerhone, 
radio or cther sound source. The signal is fed through a 
series Or specialized filters and compressors to the 
analysis section of the VOCOM receiver still in basic audio 
Signal forn. iene msne kt step a diyital analysis cccurs 
where a number of parameters may be varied which determine 


the amount of digital data to be stored or transmitted. 


These varying parameters are rate analysis, normally 
20-30 times a second for speech; the numper of points on the 
frequency <¢pfectrum to be Sampled; and the precise 
freguencies for each of these points. Up to 64 individual 
Frequencies (individually) can be analyzed ranging from 0-16 
kKilohertz. For each of these points, up to 64 levels may be 
detected thus allowing a large amount of data to be absorbed 


by the receiver. [239] 


The principle of operation is Simple: (1) an = anaiog 
Signal is transmitted, (2) it 1s analyzed by means or a 
Special versicn of a fast Fourier transform, (3) it 1s 
rearranged so as to only resemble the original contents, and 
(4) it is retransmitted as a Series of instructions to the 
VOCOM receiver which then (5) — FeECONnVErtS it =tor an 
understandable analog signal. fhis system is unigue in that 
both the receiver and the transmitter are computers. It is 
hot data that 1s transmitted as much as it is precise 
instructions. Because these instructions require littie data 
Eomcatse very large changes in the final output, a data 
reduction is possible. This synthesizing machine is 
programmable in waveform, frequency, amplitude, and time of 


Change of frequency and amplitude. 
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D. SOFTWARE 


Specially developed programs called VOCAB and 4H4uUSYS 
Below xvceal-time transformation into computer instructions. 
Mewtcnis point data reduction eliminates redundant speech 
Babes and keeps the meaningful parts. For example, if a 
spoken word is drawn out it 1S not necessary to continue the 


sound every instant until 1t is complete; rather, it is only 


necessary for a computer instruction to say "continue this 
moumde at this rate for a certain period". The VOCOM 
Sapebility ck Storing, mixing, and COlwrnui ng sounds 
increases the security of voice communications. Other 


reductions pick out peaxs in the data, calculate variations 
in the frequency and amplitude and identify the sound source 
such as a telephone. Telephone identification cffers 
Mmimtaer Lreauction since only the bandwidth of the source 1s 
reguired. Data reduction eliminatesS normal pauses and gaps 


in speech. AS an example of the instructions: 


Eiomnothing £or .23 seconds. 


tkeep on going like you are for .11 seconds." 


"Change the frequency 230 hz over .13 seconds at rate Xx 


fer: Siience.* 


(rt us the end of a sentence. drop the overall pitch for 
10 seconds at rate 4." [30] 


The digital data between the receiver and transmitter 
Cah be transmitted at a rate of less than 1000 bits per 
second for speech and higher if it becomes necessary. It 1S 


estimated that telephone speech can be transmitted at 200 
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bits per second, representing a data reduction when 


transmitting. 


To reiterate pertinent points: a digital computer is 
able to program 64 oscillators, each capable of producing 
three periodic waveforms at any amplitude. In theory the 
system 1s capable of reproducing any sound. Amplitude and 
freguency change is separately defined for each oscillator. 
A crystal clock can communicate to and fron the computer 
giving interrupts at appropriate programmable intervals. 
Three output amplifiers can be digitally controlled for 


overall dynamic changes. [29]} 


E. COs P CONSIDERATIONS 


This system is not only reliabie but also inexpensive. 
Hardware prices quoted by Digital Corporation, the 
manufacturers of the PD?P8 minicomputer [3, 13] indicate that 
the hardware can be purchased for approximately $40,000. the 


figures listed below are within ten percent accuracy. 


PDP8 Computer $7,600 (2 required) 
Memory Box 5 00 0 

Disc File $3,950 (2 required) 
Paper Tape reader/Punch $4,200 

LA 36 Terminal S2ENTS 

Cabinet $850 

Bootstrap Loader $ 500 

Peitack Magnetic Tape Unit $11,500 

Crystal Clock $ 400 
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15 Channei DigitalyAnalog Converters $1,000 (2 required) 
Total Di On | we 


Compared to hardware components of another system such as an 
eguivalent IE™M% 360 system to accomplisn the same job, the 
cost to purchase the system iS eStimated to be $200,000. 
[20] Although the IBM 360 has a greater capability than the 
PDP8, this comparison is meaningful since iarge computers 
are used or Similar applications. Use OF other 


minicomputers could yieid similar results of the PDP8. 


Some hardware components differ significantly in price 
and would seem to be unjustified until further inspection is 
made. For example, in an analog system, an analog multiplier 
would cost approximately $15 while ina digital systen a 
digital multiplier would cost about $125; an important 
Meweantage exists in the use of digital multipliers in that 
mieyecan be time-shared anong the 64 oscillators at a cost 
Savings of 600%. Additionally, the steadily decreasing costs 
of large scale integration devices makes digital equipment 


ancreasingly attractive. [6] 

The overali system offers enormous savings in the 
channel capacity needed for transmitting voice signals for a 
fractional increase in the cost of the terminal equirment. 
For assessing costs and benefits the following factors, 
uSing a Single line, must be considered: 

V=CxLxF 

C cost of line per mile 


L length of line in miles 


Vevocod termindt cost 
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F compression factor 


GP. HYPOTHETICAL APPLICATION 


Currently, ncrmal 3000 hertz voice grade telephone lines in 
the United States cost $5.48 per month per mile. [22] These 
lines aré unconditioned , private, leased, and capabie of 
mae duplex operation. They are also capable of carrying 
either voice cr data signals and are single channel. On 
short lines, of, say, less than 100 miles, the cost of 
terminal eguipment dominates while cn long lines, the iines 
themselves determine the costs. To illustrate an example, a 
Singie line from the Naval Postgraduate School to 
Washington, £.C., a distance of approximateiy 3000 miles, 
moma cost $5.48 x 3000 or $16,440 per month. tultiplying 
this by 13, the number of autovon iines at the school, the 
Cost goes to $213,720. It 1S possible, through use of the 
VOCOM system, to use one Single existing line and through 
multiplexing, still have the equivalent of 13 full duplex 
lines. This cost savings in line usage alone would amount 
memols?,2800. En actual practice the school only pays for the 
terminal eguipment at the switchboard and the major expense 
for the autovcn is borne by COMANAVTELCOM. 


The autovon lines are actualiy switched at a switching 
center at Lodi California, a distance of approximately 76 
air miles. The thirteen lines going to Lodi cost monthly 
$5,460, while if only one line were used the cost would be 
Seimeys 2420, a Significant savings. Thirteen channels could 
conceivably, Ly using the VOCOM system, reduce the line 
costs at aesavings of $5,040 per month. One VOCOM system 
CostS apfroximately $50,000 and would, through annual 


Savings, fay for itself in less than one year. 


26 





More importantly is the fact that the secure voice 
feature could be added at a relatively inexpensive price. 
measuring the actual value of the Secure voice capability is 
difficult Since the urgency of tne needed information is 
subjective and requires higher level deciSion-making kefore 
action can be taken. The value therefore cannot be expressed 
in dollars but rather in time savings. In this example 
classified information could be excnanged between personnel 
from the Naval Postgraduate School and personnel fron 
Washingtcn D. C. in real-time aS opposed to the typically 
delayed two or nore weeks which it takes to classify, and 


mail the information. 


A simple, Single iine user serves to show potential 
Savings on line costs alone. A single line is capasle of 
handling 4,800 bits per second. [10] By reducing the amount 
of data reguired for voice to 200 bits per second, the 
Peeemcial for 24 channels exists. Line conditioning, a 
modification to lines allowing increased data handling 
capability, increases further the handling capacity of this 
system. AS an example, a 96 Channel VOCOM unit could ce put 
in to service with the new 96 channei Bell D2 channel bank 
enabling the equipment to operate on a Single T1 line which 
otherwise haS a capacity of only 24 channels. This immediate 
MeaumrtOolad bcost to line profitability could be further 
increased up to 100 times the previous traffic depending on 


voice fidelity and time allocation requirements. 
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1975 and projected 


ne Luding 


hardware, 


are illustrated below. 


Hg75 


woo lines 
10 lines 
1 line 
1980 

100 lines 
10 lines 


ie 2ane 


These fidgures are for use with trunk 
the number of lines increases, 


augmilricanty. 


1980 
software, 

1 OS 
system systems 
pea. S25 0U 
$15,000 Oy 20 
$50,000 eo, U0 C 
see 7/310 pl Zou 
$10,000 ‘Ted pay) 
$37,500 $15,000 


The system could be used as shown in 


costs 


of the VOCOM systens 


(25 jeand Other direct costs 


1000 


Syscens 
Slee oO 
ba, (50 


$25,000 


$750 
be 010 


$10,000 


telephone lines. As 


the amount of cost decreases 


figure 


3. Figure 4 shows cost/benefit relationships. 
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Original development and cost of the VOCOM software is 
derived from the total cost of one system ($75,000) less the 
cost of the hardware ($40,000) or approximately $35,000. 
This sunk cost in technology and development will be reduced 


as the number of systems increases. 


Ge. SECURING THE SYSTEM 


There exist today numerous voice coding systems. Various 
modifications to modulation techniguesS incorporate sone 
degree of bandwidth compression in the encoding process. 
However, no digital coding scheme satisfactorily encodes 
Speech at less than 19.2 kilobits per second; thus, to 
achieve a 9.6 kilobits per second (maximum amount of data 
handling capacity on present lines} voice digitizer, an 
additional bandwidth compression of at ieast 2:1 must first 
take place. {8} These systems obviously rejuire wider 
bandwidth parameters than are necesSary with the VOCON 
systen. They also attempt to reproduce all the elements 
(accent, drawl, emotion) unigue to the speaker but are stiil 


not successful at exact duplication. 


Instead cf reproducing all theSe elements, the VOCOH 
System recreates and produces a sound by using coded data of 
much reduced density. The sound is not a reproduced exact 
sound of the speaker but a synthesized recreation of it 
using sophisticated sound generators. For the individual 
user, a terminal box must be available which is simply a 
Small portable box similar to an electronic calculator which 
Miil attach to a moRmall telephone handset. This terminal is 
forecasted to cost approximately $30 but it is expected that 
the user will rent the terminal along vith other user 


Services such as a supply of confidential code numbers and 
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MmeectOries fOr a particular user or group of users. By 
keying specified code numbers on the terminal box the user 
gains access to the VOCOH system and can speak through his 
terminal box in a secure fashion. The confidential code can 
ensure complete security by both changing codes at regularly 
scheduled intervals (hourly, daily, randomly, etc.) and can 
further be scrambled by changing codes automatically. In the 
VOCOM system any of the methods described in the 
introduction could be used. It has the further advantage of 
Poet i~Sscrambling “" from user to user. (27}] It would be 
possible for example to have a conference call with several 
people talking and none of them receiving the same exact bit 


stream because of their own personal codes. 


Software such as clumsy and buiky keylists which 
currently exist in the military could be reduced bcth in 
Size and in complexity. Fach authorized command or person 
could receive monthly codes instead of programming keystream 
generators or crypto boards. He would use his code for the 
Pomeereutdr period, punch it into the terminal, and commence 
communicating securely. A Naval operational commander (or 
anyone with proper clearance and need to know) could get 
real-time resolutions to problems instead of waiting for 
misunderstandings in nessages, incomplete or delayed 


messages, and mistakes in nessages. 
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ITV. CONCLUSTONS 


The need for secure voice communications 1S ever 
/Mietcasing both in the military and in civil circles. More 
importantly, the need to acguire this capability while at 
the same time reducing costs and improving performance is 
highly desirable. Present systems in use today are for the 
most part analog systems which require large usage of 
bandwidth, expenSive equipment, and cumbersome, bulky, and 
awkward software support. Large bandwidth usage 1s 


Virtually syncnynous with large costs. 


In the military there is a great need for secure voice 
communications and prohibitive costs limit the nunker of 
available secure voice terminals. Both voice quality 
improvement and quantity increases are necessary in today's 
Military forces. Most presently used systems operate in the 
electromagnetic spectrum above the HF range and as a 
consejuence, much communications takes place on uncovered 


Seeeits which tends to “leak” classified information. 


Present voice coding methods have room for improvement. 
Methods are available to both decrease costs waile at the 
Same time increasing performance. Because of the 
advantages, digital techniques will be employed in the 
future for voice communications. The system proposed here 
haS numerous advantages because of digital techniques and 
the potential for changing the whole method of 
communicating. The most difficult problem which remains 1S 
transferring modern technology and implementing these 
technigues. The proposed system is readily adaptable to 


both presently used analog lines as well as modern, 
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conditioned, high data lines. 


Costs of the proposed system can be reduced over a 
Period of time and With increased numbers of the systems, 
costs continue to decrease. Potential savings in line usage 
alcne have been discussed and it has been shown how the 


system could conceivably pay for itself over time. 


The system is already in itself secure. Communications 
security precedures and publication nandling could be 
designed to integrate with present procedures. An additional 
advantage exists in that current software requirements for 


Searing, keylists, etc., could be reduced Significantly. 


Presently used voice coding Systems have a 
SwiGatOnization process which they go through prior to 
eStaplishing communications. With the proposed systeag, 
synchronous linking would be established at the patch panel 


€lther manually, or through previously programed nethcds. 


Bandwidth uG a technical resource which must be 
conserved. This proposed system offers a method Or 
conserving FE-andwidth by increasing the efficiency of its 


usage. 


For military applicaticns there exist some apparent 
disadvantages. Most important is in the oscillators 
themselves. Typically, oscillators tend to drift in 
frequency and tuning accuracy is difficuit. Maintenance of 
prceperly tuned and stabilized oscillators may ke an 
expensive and unforseen cost which could affect the overall 


system significantly. 
Duplicate systems would have to be available in critical 


coOlimunications links. If only one system were in operation 


anc the systen became disabled because of loss of oscillator 
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Peequency Stabilization or for any other reason, a back-up 


system would be required thus further increasing costs. 


A further disadvantage exists in that an alarm system to 
warn the user if he is actually talking in a secure fashion 
would be required. AS the System exists, there is no method 
of determining whether the equipment is in fact operating in 


either a Secure Or an unsecure mode. 
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V. RECOMMENDATIONS 


A continuing investigation of this proposed voice coding 
system is needed. Appiication to telephone lines have been 
discussed cut there remains potentially dollar and bandwidth 
Savings to be realized in HF, UHF, sateilite, and even in LF 
and VLF applications. Use of the proposed system in these 


areas requires further investigation. 


The proposed system should pe officially investigated by 
the Navy in an in depth feasibility study. To easé the 
Sonpelcactions Of COmmunicating With Electronic Music Studios 
in London, it would be beneficial for some military unit 
there (not necessarily limited to the Navy) to make an 
OM—site€ investigation and study. The Office of Naval 


Research Londcn would be the prime candidate. 


Since the system has proven itself in the civilian 
arena, it shculd be demonstrated and applied using military 


peripheral eguipment. 


The use of a programmable, changing and flexible random 


code changing device needs to be investigated further. 


mesting Cf privacy and intelligibility, flexibility anda 
security must be coordinated with the National Security 
Agency tc find if standard requirements can be met. 


Reliability tests should be included. 
Further design modifications should be considered for 


the use cf microcomputers and other Large scale integration 


devices for rfotential use as portable equipment in field 
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orerations. 


It is realized that Electronics Music Studios literature 
haS provided a large portion of the data for this study and 
in that context it may be somewhat biased. The United 
States spends annually millions of dollars {2} in research 
and develcpment of communications security equipment; it is 
hoped that some of that could be invested in this syster. 
in any event, the initial equipment exists and secure voice 
communication with bandwidth reduction has been realized. 
it 1s the hope of this author that future investigation and 
Naval interest will iead to a complete, or at Minimum a 


partial operational network. 
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